NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Discourse Analysis via Questions and Answers: Parsing Dependency Structures of Questions Under Discussion

Ko, Wei-Jen; Wu, Yating; Dalton, Cutter; Srinivas, Dananjay; Durrett, Greg; Li, Junyi Jessy (July 2023, Findings of the Association for Computational Linguistics: ACL 2023)

Automatic discourse processing is bottlenecked by data: current discourse formalisms pose highly demanding annotation tasks involving large taxonomies of discourse relations, making them inaccessible to lay annotators. This work instead adopts the linguistic framework of Questions Under Discussion (QUD) for discourse analysis and seeks to derive QUD structures automatically. QUD views each sentence as an answer to a question triggered in prior context; thus, we characterize relationships between sentences as free-form questions, in contrast to exhaustive fine-grained taxonomies. We develop the first-of-its-kind QUD parser that derives a dependency structure of questions over full documents, trained using a large, crowdsourced question-answering dataset DCQA (Ko et al., 2022). Human evaluation results show that QUD dependency parsing is possible for language models trained with this crowdsourced, generalizable annotation scheme. We illustrate how our QUD structure is distinct from RST trees, and demonstrate the utility of QUD analysis in the context of document simplification. Our findings show that QUD parsing is an appealing alternative for automatic discourse processing.
more » « less
Full Text Available
Discourse Comprehension: A Question Answering Framework to Represent Sentence Connections

Ko, Wei-Jen; Dalton, Cutter; Simmons, Mark; Fisher, Eliza; Durrett, Greg; Li, Junyi Jessy (December 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

While there has been substantial progress in text comprehension through simple factoid question answering, more holistic comprehension of a discourse still presents a major challenge (Dunietz et al., 2020). Someone critically reflecting on a text as they read it will pose curiosity-driven, often open-ended questions, which reflect deep understanding of the content and require complex reasoning to answer (Ko et al., 2020; Westera et al., 2020). A key challenge in building and evaluating models for this type of discourse comprehension is the lack of annotated data, especially since collecting answers to such questions requires high cognitive load for annotators. This paper presents a novel paradigm that enables scalable data collection targeting the comprehension of news documents, viewing these questions through the lens of discourse. The resulting corpus, DCQA (Discourse Comprehension by Question Answering), captures both discourse and semantic links between sentences in the form of free-form, open-ended questions. On an evaluation set that we annotated on questions from Ko et al. (2020), we show that DCQA provides valuable supervision for answering open-ended questions. We additionally design pre-training methods utilizing existing question-answering resources, and use synthetic data to accommodate unanswerable questions.
more » « less
Full Text Available
Assessing Discourse Relations in Language Generation from GPT-2

Ko, Wei-Jen; Li, Junyi Jessy (December 2020, Proceedings of the 13th International Conference on Natural Language Generation)
null (Ed.)
Full Text Available
Domain Agnostic Real-Valued Specificity Prediction

https://doi.org/10.1609/aaai.v33i01.33016610

Ko, Wei-Jen; Durrett, Greg; Li, Junyi Jessy (July 2019, Proceedings of the AAAI Conference on Artificial Intelligence)

Sentence specificity quantifies the level of detail in a sentence, characterizing the organization of information in discourse. While this information is useful for many downstream applications, specificity prediction systems predict very coarse labels (binary or ternary) and are trained on and tailored toward specific domains (e.g., news). The goal of this work is to generalize specificity prediction to domains where no labeled data is available and output more nuanced realvalued specificity ratings.We present an unsupervised domain adaptation system for sentence specificity prediction, specifically designed to output real-valued estimates from binary training labels. To calibrate the values of these predictions appropriately, we regularize the posterior distribution of the labels towards a reference distribution. We show that our framework generalizes well to three different domains with 50%-68% mean absolute error reduction than the current state-of-the-art system trained for news sentence specificity. We also demonstrate the potential of our work in improving the quality and informativeness of dialogue generation systems.
more » « less
Full Text Available
Inquisitive Question Generation for High Level Text Comprehension

https://doi.org/10.18653/v1/2020.emnlp-main.530

Ko, Wei-Jen; Chen, Te-yuan; Huang, Yiyan; Durrett, Greg; Li, Junyi Jessy (January 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP))
null (Ed.)
Full Text Available
Linguistically-Informed Specificity and Semantic Plausibility for Dialogue Generation

https://doi.org/10.18653/v1/N19-1349

Ko, Wei-Jen; Durrett, Greg; Li, Junyi Jessy (January 2019, Proceedings of the 2019 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers))

Sequence-to-sequence models for open-domain dialogue generation tend to favor generic, uninformative responses. Past work has focused on word frequency-based approaches to improving specificity, such as penalizing responses with only common words. In this work, we examine whether specificity is solely a frequency-related notion and find that more linguistically-driven specificity measures are better suited to improving response informativeness. However, we find that forcing a sequence-to-sequence model to be more specific can expose a host of other problems in the responses, including flawed discourse and implausible semantics. We rerank our model{'}s outputs using externally-trained classifiers targeting each of these identified factors. Experiments show that our final model using linguistically motivated specificity and plausibility reranking improves the informativeness, reasonableness, and grammatically of responses.
more » « less
Full Text Available

Search for: All records